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SEQUENCE LISTING 

(1) GENERAL IN^toRMATION: 

(i) APPLIcA^IT: Reed, John 
(ii) TITLE OR INVENTION: Regulation of bcl-2 Gene Expression 
(iii) NUMBER OR SEQUENCES: 2 3 

( iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Gray, Gary, Ames & Frye 

(B) STREE*^ 401 B Street, Suite 1700 

(C) CITY: San Diego 

(D) STATE: CA 

(E) COUNTRY \ USA 

(F) ZIP: 92lV)l-4297 

(V) COMPUTER READ^LE FORM: 

(A) MEDIUM TYPE: Diskette 3.5 inch, 1.44 Mb storage 

(B) COMPUTER: \IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version =1.25 

(vi) CURRENT APPLICAmON DATA: 

(A) APPLICATION^ NUMBER: US 

(B) FILING DATE? 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INRORMATION: 

(A) NAME: BrotmanV Harris F. 

(B) REGISTRATION NUMBER: 35461 

(C) REFERENCE/DOCHET NUMBER: P0041US0 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (6l4) 699-3630 

(B) TELEFAX: (619) ^36-1048 

(2) INFORMATION FOR SEQ ID NO : 1 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2 0 base plfirs 

(B) TYPE: nucleic acic 

( C ) STRANDEDNESS : s ing^ e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ VeD N0:1: 



CCCTTCCTAC CGCGTGCGAC 



20 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) lOINGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECuIe TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCH DESCRIPTION: SEQ ID NO: 2 
CTTTTCCTCT GGGAAGGATG GCGCACGCTG GGAGA 
(2) INFORMATION FQR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE:\ nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYFE: DNA (genomic) 
(iv) ANTI-SENSE: \yES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:: 
CCTCCGACCC ATCCACGTAG 
(2) INFORMATION FOR SEQ\ID NO : 4 : 

(i) SEQUENCE CHARAcVeRISTICS: 

(A) LENGTH: 33\base pairs 

(B) TYPE: nucleUc acid 

( C) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DN^ (genomic) 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTKSN : SEQ ID NO: 4 
ACGGGGTACG GAGGCTGGGT AGGTGCATCT GGT 
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(2) INFORMATlW FOR SEQ ID NO: 5: 

(i) SEQUElk:E CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE VypE: DNA (genomic) 
(iv) ANTI-SENSe\ YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5; 
GTTGACGTCC TACGGAAACA 
(2) INFORKATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nuclWic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTl3&N : SEQ ID NO: 6: 
CCCCCAACTG CAGGATGCCT TTGTGGkACT GTACGG 
(2) INFORMATION FOR SEQ ID No\ 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base H^i^rs 

(B) TYPE: nucleic aci^ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linea: 

(ii) MOLECULE TYPE: DNA (gendmic) 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ^D N0:7 
GGGAAGGATG GCGCACGCTG 
(2) INFORMATION FOR SEQ ID NO : 8 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) y^NGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) aTRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENteE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
CGCGTGCGAC CCTCTTG 

(2) INFORMATION FOR ^EQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: \l7 base pairs 

(B) TYPE: nifcleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE:\dNA (genomic) 
(iv) ANTI-SENSE: YEi 



17 



(xi) SEQUENCE DESCRII^ION: SEQ ID NO: 9; 
TACCGCGTGC GACCCTC 

(2) INFORMATION FOR SEQ Id\nO:10: 

(i) SEQUENCE CHARACTEMTSTICS: 

(A) LENGTH: 17 bake pairs 

(B) TYPE: nucleiclacid 

(C) STRANDEDNESS: k ingle 

(D) TOPOLOGY: linekr 

(ii) MOLECULE TYPE: DNA |genomic) 
(iv) ANTI-SENSE: YES 

(Xi) SEQUENCE DESCRIPTION :\ SEQ ID NO: 10; 
TCCTACCGCG TGCGACC 
(2) INFORMATION FOR SEQ ID NO:i: 

(i) SEQUENCE CHARACTERISTIC^ 

(A) LENGTH: 17 base pai\rs 

(B) TYPE: nucleic acid 



17 



17 
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(C) ^TRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECuk type: DNA (genomic) 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE Description: seq id no:ii: 

CCTTCCTACC GCGTGCG 

(2) INFORMATION FOR Wq ID NO: 12: 

(i) SEQUENCE characteristics: 
^ (A) L£NGTHA 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) T0P0L0G"«: linear 

(ii) MOLECULE TYPe\ DNA (genomic) 
(iv) ANTI-SENSE: YElj 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12= 
GACCCTTCCT ACCGCGT 

(2) INFORMATION FOR SEQ id NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 bdse pairs 

(B) TYPE: nucleic\ acid 

(C) STRANDEDNESS:\ single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA ^genomic) 
(iv) ANTI-SENSE: YES 

(Xi) SEQUENCE DESCRIPTION r^EQ ID N0:13: 
GGAGACCCTT CCTACCG 
(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base p^rs 

(B) TYPE: nucleic acidV 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear \ 



17 



17 




17 
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(ii) MOLECU^ TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(Xi) SEQUENCE \eSCRIPTION: SEQ ID N0:14: 
GCGGCGGCAG CGCGG 

(2) INFORMATION FOr\sEQ ID NO: 15: 

(i) SEQUENCE characteristics: 

(A) LENGTHi 15 base pairs 

(B) type: liucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYP^: DNA (genomic) 
(iv) ANTI-SENSE: YfeS 



15 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGGCGGGGCG ACGGA 

(2) INFORMATION FOR SEQ CD NO: 16: 

(i) SEQUENCE CHARAcWlSTICS: 
^ (A) LENGTH: leAbase pairs 

(B) TYPE: nucleac acid 

(C) STRANDEDNESB : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: D^ 
(iv) ANTI-SENSE: YES 



(genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:16; 
GGAGCGCG GCGGGC 
(2) INFORMATION FOR SEQ ID 



16 



10 : 17 : 



(i) SEQUENCE CHARACTERISTICS: 
^ (A) length: 18 basfe pairs 

(B) TYPE: nucleic 4=^° 

(C) STRANDEDNESS : ^mgle 

(D) TOPOLOGY.: linedr 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
TCTCCCAGCG TGCGQCAT 
(2) INFORMATION TOR SEQ ID NO: 18: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTOi: 18 base pairs 

(B) TYPE:\nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRM>TION: SEQ ID NO: 18 
TGCACTCACG CTCGGCCT \ 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(X) LENGTH: 5086 base pairs 
(b\ TYPE: nucleic acid 
(CASTRANDEDNESS: single 
(D) yroPOLOGY: linear 



(xi) SEQUENCH DESCRIPTION: SEQ ID N0:19: 



GCGCCCGCCC 


CTCCGCGCCG 


CCTGCCCGCC 


CGCCCGCCGC 


GCTCCCGCCC 


GCCGCTCTCC 


60 


GTGGCCCCGC 


CGCGCTGCCG 


CCGCCGCCGC 


TGCCAGCGAA 


GGTGCCGGGG 


CTCCGGGCCC 


120 


TCCCTGCCGG 


CGGCCGTCAG 


CGCTCGGAGC 


GAACTGCGCG 


ACGGGAGGTC 


CGGGAGGCGA 


180 


CCGTAGTCGC 


GCCGCCGCGC 


Y^GACCAGGA 


GGAGGAGAAA 


GGGTGCGCAG 


CCCGGAGGCG 


240 


GGGTGCGCCG 


GTGGGGTGCA 


qCGGAAGAGG 


GGGTCCAGGG 


GGGAGAACTT 


CGTAGCAGTC 


300 


ATCCTTTTTA 


GGAAAAGAGG 


GmAAAATAA 


AACCCTCCCC 


CACCACCTCC 


TTCTCCCCAC 


360 


CCCTCGCCGC 


ACCACACACA 


GCGCGGGCTT 


CTAGCGCTCG 


GCACCGGCGG 


GCCAGGCGCG 


420 


TCCTGCCTTC 


ATTTATCCAG 


CAGOTTTTCG 


GAAAATGCAT 


TTGCTGTTCG 


GAGTTTAATC 


480 


AGAAGACGAT 


TCCTGCCTCC 


GTCCOCGGCT 


CCTTCATCGT 


CCCATCTCCC 


CTGTCTCTCT 


540 


CCTGGGGAGG 


CGTGAAGCGG 


TCCCGTbGAT 


AGAGATTCAT 


GCCTGTGTCC 


GCGCGTGTGT 


600 


GCGCGCGTAT 


AAATTGCCGA 


GAAGGGOAAA 


ACATCACAGG 


ACTTCTGCGA 


ATACCGGACT 


660 


GAAAATTGTA 


ATTCATCTGC 


CGCCGCCGUT 


GCCAAAAAAA 


AACTCGAGCT 


CTTGAGATCT 


720 


CCGGTTGGGA 


TTCCTGCGGA 


TTGACATTTC 


TGTGAAGCAG 


AAGTCTGGGA 


ATCGATCTGG 


780 


AAATCCTCCT 


AATTTTTACT 


CCCTCTCCCcX 


CCGACTCCTG 


ATTCATTGGG 


AAGTTTCAAA 


840 


TCAGCTATAA 


CTGGAGAGTG 


CTGAAGATTG 


Vtgggatcgt 


TGCCTTATGC 


ATTTGTTTTG 


900 


GTTTTACAAA 


AAGGAAACTT 


GACAGAGGAT 


CATGCTGTAC 


TTAAAAAATA 


CAAGTAAGTC 


960 


TCGCACAGGA 


AATTGGTTTA 


ATGTAACTTT 


CAATGGAAAC 


CTTTGAGATT 


TTTTACTTAA 


1020 


AGTGCATTCG 


AGTAAATTTA 


ATTTCCAGGC 


AGCITAATAC 


ATTGTTTTTA 


GCCGTGTTAC 


1080 


TTGTAGTGTG 


TATGCCCTGC 


TTTCACTCAG 


TGTGTACAGG 


GAAACGCACC 


TGATTTTTTA 


1140 


CTTATTAGTT 


TGTTTTTTCT 


TTAACCTTTC 


AGCATc\cAG 


AGGAAGTAGA 


CTGATATTAA 


1200 


CAATACTTAC 


TAATAATAAC 


GTGCCTCATG 


AAATAAASAT 


CCGAAAGGAA 


TTGGAATAAA 


1260 


AATTTCCTGC 


GTCTCATGCC 


AAGAGGGAAA 


CACCAGAATC 


AAGTGTTCCG 


CGTGATTGAA 


1320 
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GACACCCCCTV 


CGTCCAAGAA 


TGCAAAGCAC 


ATCChATPiAA 


ATAGCTGGAT 


TATAACTCCT 


1380 


CTTCTTTCTC 


TteGGGGCCGT 


GGGGTGGGAG 


CTGGGGCGAG 


AGGTGCCGTT 


GGCCCCCGTT 


1440 


GCTTTTCCTC 


TGGfeAAGGAT 


GGCGCACGCT 


GGGAGAACGG 


GGTACGACAA 


CCGGGAGATA 


1500 


GTGATGAAGT 


ACATCCATTA 


TAAGCTGTCG 


CAGAGGGGCT 


ACGAGTGGGA 


TGCGGGAGAT 


1560 


GTGGGCGCCG 


CGCCCCCGGG 


GGCCGCCCCC 


GCACCGGGCA 


TCTTCTCCTC 


ccagcccggg 


1620 


CACACGCCCC 


ATCCAGCCqC 


ATCCCGCGAC 


CCGGTCGCCA 


GGACCTCGCC 


GCTGCAGACC 


1680 


CCGGCTGCCC 


CCGGCGCCGCa 


CGCGGGGCCT 


GCGCTCAGCC 


CGGTGCCACC 


tgtggtccac 


1740 


Vw> X w w w w Vw J. WW 


GCCAAGCCGG 


Vgacgacttc 


TCCCGCCGCT 


ACCGCGGCGA 


CTTCGCCGAG 


1800 


C\ X v3 X W W^^ w W W 


AGCTGCACCT 


GaCGCCCTTC 


ACCGCGCGGG 


GACGCTTTGC 


CACGGTGGTG 


1860 


A n A G d T CT* 


TCAGGGACGG 


GGTO5AACTGG 


GGGAGGATTG 


TGGCCTTCTT 


TGAGTTCGGT 


1920 


GGGGTCATGT 


GTGTGGAGAG 


CGTCkACCGG 


GAGATGTCGC 


CCCTGGTGGA 


CAACATCGCC 


1980 


CTGTGGATGA 


CTGAGTACCT 


GAACC^CAC 


CTGCACACCT 


GGATCCAGGA 


TAACGGAGGC 


2040 


TG GG AT GCCT 


TTGTGGAACT 


GHACQQC^C 


AGCATGCGGC 


CTCTGTTTGA 


TTTCTCCTGG 


2100 


rTGTCTCTGA 

W X w X W X W X 


AGACTCTGCT 


CAGTTTGGOC 


CTGGTGGGAG 


CTTGCATCAC 


CCTGGGTGCC 


2160 


X A X W X VSrt wW w 


A C AAGTG AAG 


TCAACATGCC 


^GCCCCAAAC 


AAATATGCAA 


AAGGTTCACT 


2220 


A A AGPAGTAG 


AAAT AAT AT G 

X^X^^^ X «x»x X X 


CATTGTCAGT 


GATGTACCAT 


GAAACAAAGC 


TGCAGGCTGT 


2280 


TTAAGAAAAA 


ATAACACAXA 


TATAAACATC 


ACACACACAG 


ACAGACACAC 


ACACACACAA 


2340 


CAATTAACAG 


TCTTCAGGCA 


AAACGTCGAA 


TCAGLCTATTT 


ACTGCCAAAG 


GGAAATATCA 


2400 


TTTATTTTTT 


ACATTATTAA 


GAAAAAAGAT 


TTATTOATTT 


AAGACAGTCC 


CATCAAAACT 


2460 


CCGTCTTTGG 


AAATCCGACC 


ACTAATTGCC 


AAACAcWcT 


TCGTGTGGCT 


CCACCTGGAT 


2520 


VJX XWXVJXwWW 


TGTAAACATA 


GATTCGCTTT 


CCATGTTOTT 


GGCCGGATCA 


CCATCTGAAG 


2580 


AGPAGAPGGA 


TGGAAAAAGG 


ACCTGATCAT 


TGGGGAAGC'^ 


GGCTTTCTGG 


CTGCTGGAGG 


2640 




GGTGTTCATT 

^7^7 X \S X X X X 


CACTTGCATT 


TCTTTGCCCT 


TSGGGGCGTGA 


TATTAACAGA 


2700 


GGGAGGGTTC 


CCGTGGGGGG 


AAGTCCATGC 


CTCCCTGGCC 


TO^GAAGAG 


ACTCTTTGCA 


2760 


TATGACTCAC 


ATGATGCATA 


Uw X vjVj 1 tjVjijA 


GGAAAAGAGT 


TGGGAACTTC 


AGATGGACCT 


2820 


AGTACCCACT 


GAGATTTCCA 


CGCCGAAGGA 


CAGCGATGGG 


aaaaXtgccc 


TTAAATCATA 


2880 


GGAAAGTATT 


TTTTTAAGCT 


ACCAATTGTG 


CCGAGAAAAG 


CATTTTAGCA 


ATTTATACAA 


2940 


TATCATCCAG 


TACCTTAAAC 


CCTGATTGTG 


TATATTCATA 


TATTTTGCAT 


ACGCACCCCC 


3000 
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tSia ptgg ct ct 


GTCTGAGTAA 


GAAACAGAAT 


CCTCTGGAAC 


TTGAGGAAGT 


3060 




VJ X ^X***^ ^ * 


ATCAGGAAGG 


CTAGAGTTAC 


CCAGAGCATC 


AGGCCGCCAC 


3120 




XXX XnOwAwn 


CCGAAGTCCG 


CAGAACCTAC 


CTGTGTCCCA 


GCTTGGAGGC 


3180 






GGCCCTCACT 


GGCCTCCTCC 


AGGGATGATC 


AACAGGGTAG 


3240 




r:A ATGTnfTGG 


AAGCTGATGG 


ATGGAGCTCA 


GAATTCCACT 


GTCAAGAAAG 


3300 




GGGTGTGGCT 

X V3 X wwV^ X 


GGGCCTGTCA 


CCCTGGGGCC 


CTCCAGGTAG 


GCCCGTTTTC 


3360 




TArZ/^AGPPA A 
X Av»w*Wj ^ v.»r\v— \ 


GACCCTTCTT 


AAGACATGTA 


TCACTGTAGA 


GGGAAGGAAC 


3420 


AGAGGCCCTG 


ijtjv^^ X X ^Vi* X A 


T5PAGAAGGAP 


ATGGTGAAGG 


CTGGGAACGT 


GAGGAGAGGC 


3480 


AATGGCCACG 


Q7L.L.^AX X X XVj 


GrV'GTAGPAP 


ATGGCACGTT 


GGCTGTGTGG 


CCTTGGCCAC 


3540 


CTGTGAGTTT 


AAAG L. AA^j C 


X X XavvaXVjav* 


TTTGGAGAGG 


GTCACAAATC 


CTAAAAGAAG 


3600 


CATTGAAGTG 


Atjvj X Vj X V- A X \J 


GATT2^TTGA 

OaX XAAX X \.» a 


CCCCTGTCTA 


TGGAATTACA 


TGTAAAACAT 


3660 


TA 1 C i i G i U A 


L« X O X AL? X X X o 


GTTTTA^T'TG 

w X X X X c\ J\» ^ ^ 


AAAACCTGAC 


AAAAAAAAAG 


TTCCAGGTGT 


3720 


GGAAi AxGGG 


r* P TT A T PT T 
(jVjX XAX^^XOX 


APATCPTC!?GG 


GCATTAAAAA 


AAAATCAATG 


GTGGGGAACT 


3780 


AT AAAGAAG i 


A Ar^A AAA^^AA 
AAV-AAAAVjAA 


\3 X w A>» A X W X X 


GAG C AAAT AA 


ACTAGGAAAT 


TTTTTTTT CT 


3840 


TCCAGi i lAG 


A AT'PAPPPT'T' 
AAX wAu^Vv X X 


GA A APATTGA 

VJ AAA WAX X w A 


VpGGAATAACT 


CTGTGGCATT 


ATTGCATTAT 


3900 


AiAuUAl 1 lA 


T'/^T'PTAT'T' A A 
X L. X X AX X AA 


PT'T'TGGAATG 

^ X X X \J\J£^^\ X w 


TACTCTGTTC 


AATGTTTAAT 


GCTGTGGTTG 


3960 


AlAl i lUtjAA 


A /^P'T'nPT'T'T A 
A^jw X X X X A 


AAAAAATACA 


TGOATCTCAG 


CGTTTTTTTG 


TTTTTAATTG 


4020 


1 A i i X AO 1 1 A 


TnnPPTATAC 

X WW V— X.A X A^ 


ACTATTTGTG 

Aw X XXX w M w 


AGCM^GGTG 


ATCGTTTTCT 


GTTTGAGATT 


4080 


1 1 1 AX U X U X X 


P ATTPTTPA A 

X X X X AA 


AAGCATTCTG 


AGAAGOTGAG 


ATAAGCCCTG 


AGTCTCAGCT 


4140 


AC C i AAG AAA 


A A ppTnr; ATrz 


TPAPTGGCCA 


CTGAGGAqJCT 


TTGTTTCAAC 


CAAGTCATGT 


4200 


GCATTTCCAC 


I^^PPA APAf^AA 
X U-AAv-ALjAA 


X X O X X X ^\ X X 


TGACAGTTM? 


ATCTGTTGTC 


CCTTTGACCT 


4260 


TGTTTCTTGA 


AvjtaX X X X ^- 


\j X www X www W 


AATTCCGCAT^ 


^TTAATTCATG 


GTATTCAGGA 


4320 


TTACATGCAT 


G i i i Gvj X X AA 


APPP ATG AGA 

A^^WAX V3AVJA 


TTCATTCAGT 

i i W>«1 Ju X W>*^^J ^ 


T^AAAAATCCA 


GATGGCGAAT 


4380 


GAG C AG C AG A 


X X L. AAA X C X A 


TGGTGGTTTG 

X wTvJ X VJO X X X w 


ACCTTTAGAG 


AG^TGCTTTA 


CGTGGCCTGT 


4440 


TTCAACACAG 


ACCCACCCAG 


AGCCCTCCTG 


/-* rn ^ rn rn 


ppgAgggptt 

wwwvj%jruww X X 


TCTCATGGCT 


4500 


GTCCTTCAGG 


GTCTTCCTGA 


AATGCAGTGG 


TCGTTACGCT 


ccaccVagaa 


AGCAGGAAAC 


4560 


CTGTGGTATG 


AAGCCAGACC 


TCCCCGGCGG 


GCCTCAGGGA 


ACAGAAT^AT 


CAGACCTTTG 


4620 


AATGATTCTA 


ATITTTAAGC 


AAAATATTAT 


TTTATGAAAG 


gtttacat^g 


TCAAAGTGAT 


4680 



GAATATGGAA 
TTGCAGTATG 
GTGGACGTTT 
GAGTATTTGA 
CTTTTGCTGT 
CCCCAGAACT 
CCTTATTGTT 



TATCXAATCC 
CTCCACl5;rGG 
TTAATAT 
AAAATGTATA 
GGGGTTTTGT 
GTACAGTATT 
AAAAACATGT 



TGTGCTGCTA 
TAAGATCCTC 
GCCTGTTTTG 
'ATATTAAGA 
TACCTGGTTT 

gtg\ctgcac 
tagaaocaat 
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TCCTGCCAAA 
CAAGCTGCTT 
TCTTTTGTTG 
GGTCACGGGG 
TAATAACAGT 
TTGCTCTAAG 
GAATGTATAT 



ATCATTTTAA 
TAGAAGTAAC 
TTGTTCAAAC 
GCTAATTGCT 
AAATGTGCCC 
AGTAGTTGAT 
AAAAGC 



TGGAGTCAGT 
AATGAAGAAC 
GGGATTCACA 
AGCTGGCTGC 
AGCCTCTTGG 
GTTGCATTTT 



4740 
4800 
4860 
4920 
4980 
5040 
5086 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 717 base pairs 
(BKTYPE: nucleic acid 

(C) ^TRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECuA TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/I 

(B) LOCATIC 



CDS 

1. .717 



(xi) SEQUENCE DESCr\pTI0N: SEQ ID NO: 20: 



ATG GCG CAC GCT GGG AGA 
Met Ala His Ala Gly Arg 
1 5 

AAG TAC ATC CAT TAT AAG 
Lys Tyr lie His Tyr Lys 
20 

GGA GAT GTG GGC GCC GCG 
Gly Asp Val Gly Ala Ala 
35 

TTC TCC TCC CAG CCC GGG 
Phe Ser Ser Gin Pro Gly 
50 

CCG GTC GCC AGG ACC TCG 
Pro Val Ala Arg Thr Ser 
65 70 

GCC GCG GGG CCT GCG CTC 
Ala Ala Gly Pro Ala Leu 

85 



GGG TAC GAC AAC CGG 

ThK ^^y '^y^ 

^ 10 

CTG T^SG CAG AGG GGC TAC 
Leu Se\ Gin Arg Gly Tyr 
^ 25 

CCC CCG cteG GCC GCC CCC 
Pro Pro GlSc Ala Ala Pro 
40 



CAC ACG CCC C?AT CCA GCC 
His Thr Pro Hre Pro Ala 
55 \ ^0 



GAG ATA GTG ATG 
Glu lie Val Met 
15 

GAG TGG GAT GCG 
Glu Trp Asp Ala 
30 

GCA CCG GGC ATC 
Ala Pro Gly He 
45 

GCA TCC CGC GAC 
Ala Ser Arg Asp 



CCG CTG CAG ACC ^G GCT 
Pro Leu Gin Thr Pl^ Ala 



CTC CGC CAA GCC GGC GAC 
Leu Arg Gin Ala Gly Asp 
100 

GCC GAG ATG TCC AGC CAG 
Ala Glu Met Ser Ser Gin 
115 

CGC TTT GCC ACG GTG GTG 
Arg Phe Ala Thr Val Val 
130 



AGC CCG GTG CCA CCT 
Ser Pro Val Pro Pro 
90 

GAC TTC TCC CGC CGC TAcN 
Asp Phe Ser Arg Arg Tyr 
105 

CTG CAC CTG ACG CCC TTC 
Leu His Leu Thr Pro Phe 
120 

GAG GAG CTC TTC AGG GAC 
Glu Glu Leu Phe Arg Asp 
135 



GCC CCC GGC GCC 
Ala Pro Gly Ala 
80 

GTC CAC CTG GCC 
Val His Leu Ala 
95 

GGC GAC TTC 
Gly Asp Phe 
110 

GCG CGG GGA 
aa Arg Gly 



ACC 
Thr 
125 

GGG 

Gly 



GTG AAC TGG 
Va^Asn Trp 



48 



96 



144 



192 



240 



288 



336 



3S4 



432 
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GGG AGG ATT GTG TTC TTT GAG TTC GGT GGG GTC ATG TGT GTG GAG 

Gly Arg lie Val Alsa Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu 
145 \ 150 155 160 



480 



AGC GTC AAC CGG GAG 
Ser Val Asn Arg Glu 

165 



TCG CCC CTG GTG GAG AAC ATC GCC CTG TGG 
Ser Pro Leu Val Asp Asn lie Ala Leu Trp 
170 175 



528 



ATG ACT GAG TAC CTG AAC C(5G CAC CTG CAC ACC TGG ATC CAG GAT AAC 
Met Thr Glu Tyr Leu Asn Args. His Leu His Thr Trp lie Gin Asp Asn 
180 \ 185 190 



576 



GGA GGC TGG GAT GCC TTT GTG GAA CTG TAC GGC CCC AGC ATG CGG CCT 
Gly Gly Trp Asp Ala Phe Val GliJv Leu Tyr Gly Pro Ser Met Arg Pro 
195 200\ 205 



624 



CTG TTT GAT TTC TCC TGG CTG TCT CTi 
Leu Phe Asp Phe Ser Trp Leu Ser Leu 
210 215 



AAG ACT CTG CTC AGT TTG GCC 
jys Thr Leu Leu Ser Leu Ala 
220 



672 



CTG GTG GGA GCT TGC ATC ACC CTG GGT GCCNTAT CTG AGC CAC AAG 
Leu Val Gly Ala Cys lie Thr Leu Gly Ala Tiyr Leu Ser His Lys 
225 230 2: 



717 
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(2) INFORMATft^ FOR SEQ ID N0:21: 

(i) SEQuWcE CHARACTERISTICS: 

(A) VengtH: 239 amino acids 

(B) SyPE: amino acid 
(D) TOCOLOGY: linear 

(ii) MOLECULE IJYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Ala His Ala Gly Arg Vhr Gly Tyr Asp Asn Arg Glu He Val Met 

5 \ 10 - 15 

Lys Tyr He His Tyr Lys Le\ ser Gin Arg Gly Tyr Glu Trp Asp Ala 

^ 25 30 

Gly Asp val Gly Ala Ala Pro \ro Gly Ala Ala Pro Ala Pro Gly He 
-'^ MO 45 



Phe Ser Ser Gin Pro Gly His 
50 55 



ThJ 



Pro His Pro Ala Ala Ser Arg Asp 
60 



Pro val Ala Arg Thr Ser Pro Leu \ln Thr Pro Ala Ala Pro Gly Ala 

^° ^ 75 80 

Ala Ala Gly Pro Ala Leu Ser Pro Va\ Pro Pro Val Val His Leu Ala 

85 \ 90 95 

Leu Arg Gin Ala Gly Asp Asp Phe Ser ^g Arg Tyr Arg Gly Asp Phe 

105 >y^ 

Ala Glu Met Ser Ser Gin Leu His Leu ThrVro Phe Thr Ala Ara Gly 

120 \ 125 ' ^ 

Arg Phe Ala Thr Val Val Glu Glu Leu Phe ArJ 
130 135 



Asp Gly Val Asn Trp 

a4o 



Gly Arg He Val Ala Phe Phe Glu Phe Gly Gly kl Met Cys Val Glu 

155 \ 160 

ser val Asn Arg Glu Met Ser Pro Leu Val Asp AsAlle Ala Leu Trp 

165 170 \ ;l75 

Met Thr Glu Tyr Leu Asn Arg His Leu His Thr Trp l\ Gin Asp Asn 
180 185 \l90 

Gly Gly Trp Asp Ala Phe Val Glu Leu Tyr Gly Pro Ser Ara Pro 

1^5 200 205 

Leu Phe Asp Phe Ser Trp Leu Ser Leu Lys Thr Leu Leu Se^Leu Ala 
210 215 220 
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Leu Val Gly Ala 
225 



s lie Thr Leu Gly Ala Tyr Leu Ser His Lys 

235 



230 

(2) INFORMATION FORXsEQ ID NO: 22: 

(i) SEQUENCE CHAELACTERISTICS : 

(A) LENGTH :\ 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY :\ linear 

(ii) MOLECULE TYPE: Wa (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: l..>ei5 



(xi) SEQUENCE DESCRIPTION^ SEQ ID NO: 22; 



ATG GCG CAC GCT GGG AGA ACG 

Met Ala His Ala Gly Arg Thr 
1 5 

AAG TAC ATC CAT TAT AAG CTG 

Lys Tyr lie His Tyr Lys Leu 
20 

GGA GAT GTG GGC GCC GCG CCC 

Gly Asp Val Gly Ala Ala Pro 
35 

TTC TCC TCC CAG CCC GGG CAC 

Phe Ser Ser Gin Pro Gly His 

50 55 

CCG GTC GCC AGG ACC TCG CCG 

Pro Val Ala Arg Thr Ser Pro 
65 70 

GCC GCG GGG CCT GCG CTC AGC 
Ala Ala Gly Pro Ala Leu Ser 

85 



GG\ TAC GAC AAC 
Glv\Tvr Asp Asn 
10 

TCG c\G AGG GGC 
Ser G]^ Arg Gly 

2: 

CCG GGG ^CC GCC 
Pro Gly ^a Ala 
40 

ACG CCC CAlX CCA 
Thr Pro HisV^^o 



CTG CAG ACC CC 
Leu Gin Thr 



75' 



CCG GTG CCA CCT 
Pro Val Pro Pro 
90 



CGG GAG ATA GTG ATG 
Arg Glu lie Val Met 
15 

TAC GAG TGG GAT GCG 
Tyr Glu Trp Asp Ala 
30 

CCC GCA CCG GGC ATC 
Pro Ala Pro Gly lie 

45 

GCC GCA TCC CGC GAC 
Ala Ala Ser Arg Asp 
60 

GCT GCC CCC GGC GCC 
Ala Ala Pro Gly Ala 

80 

GTC CAC CTG GCC 
Val His Leu Ala 
95 



48 



96 



144 



192 



240 



288 
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CTC CGC 
Leu Arg 



GCC GAG 
Ala Glu 



CGC TTT 
Arg Phe 
130 

GGG AGG 
Gly Arg 
145 

AGC GTC 
Ser Val 



:aa GCC 
Ln Ala 
100 



GGC GAC GAC 
Gly Asp Asp 



AGC CAG CTG 
Ser Gin Leu 



GCC ACG\ 
Ala Thr 



ATT GTG 
He Val 



AAC CGG 
Asn Arg 



GTG GAG 
Val Glu 
135 

TTC TTT 
^Phe Phe 
L50 

GAG 
Glu 
165 



TTC TCC 
Phe Ser 
105 

CAC CTG 
His Leu 
120 

GAG CTC 
Glu Leu 



CGC CGC 
Arg Arg 



TAC CGC 
Tyr Arg 



ACG CCC 
Thr Pro 



TTC AGG 
Phe Arg 



GAG TTC 
Glu Phe 



CCC CTG 
Pro Leu 



GAG TAC 
Glu Tyr 
180 

TGG GTA 
Trp Val 
195 



CTG 
Leu 



AAC 
Asn Ar^ 



GGT GCA TCT 
Gly Ala Ser 



CAC CTG 
His Leu 
135 

aT GAT 
Asp 



GGT GGG 
Gly Gly 
155 

GTG GAC 
Val Asp 
170 

CAC ACC 
His Thr 



TTC ACC 
Phe Thr 
125 

GAC GGG 
Asp Gly 
140 

GTC ATG 
Val Met 



AAC ATC 
Asn He 



TGG ATC 
Trp He 



GTG AGT 
Val Ser 



CTG GGC 
Leu Gly 
205 




GGC GAC TTC 
Gly Asp Phe 
110 

GCG CGG GGA 
Ala Arg Gly 



GTG AAC TGG 
Val Asn Trp 



TGT GTG GAG 
Cys Val Glu 
160 

GCC CTG TGG 
Ala Leu Trp 

175 

CAG GAT AAC 
Gin ASD Asn 
190 



ATG ACT 
Met Thr 



GGA GGC 
Gly Gly 



(2) INFORMATION FOR SEQ ID NO: 23 

(i) SEQUENCE CHARACTERISTIC^ 

(A) LENGTH: 205 amino Vcids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ IdVo:23 

Met Ala His Ala Gly Arg Thr Gly Tyr Asp A^ Arg Glu He Val Met 
^ 5 10 \ 15 

Lys Tyr He His Tyr Lys Leu Ser Gin Arg Gly \yr Glu Trp Asp Ala 
20 25 \ 30 \ 

Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro^Ala Pro Glv He 
3 5 4 0 \45 " 

Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala APa Ser Ara Asp 
50 55 60 

Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala ^o Glv Ala 

70 75 \ * 80 



336 



384 



432 



480 



528 



576 



615 



Ala Ala Gl 

Leu Arg Gin 

Ala Glu Met 
115 

Arg Phe Ala 
130 

Gly Arg He 
145 

Ser Val Asn 
Met Thr Glu 



Gly Gly Trp 
195 



Pro Val 



Phe Ser 
105 
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Pro Pro Val Val His 
90 



Leu Ala 
95 



Tyr Leu 
180 

Val Gly 



Gly X^p 
200 



Arg Arg Tyr Arg Gly Asp Phe 

' 110 

Thr Pro Phe Thr Ala Arg Gly 
125 

Phe Arg Asp Gly Val Asn Trp 
140 

Gly Gly Val Met Cys Val Glu 
155 160 

Val Asp Asn He Ala Leu Trp 
170 175 

His Thr Trp He Gin Asp Asn 
190 

Val Ser Leu Gly 
205 



